Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sim2sim for stompymicro #129

Closed

Conversation

henri123lemoine
Copy link
Contributor

Summary:

  • Added base Env and base MujocoEnv environments
  • Added ~working (meh) sim2sim, robot sometimes stands for quite a while. More tests/param opt is needed
  • Added Bayesian parameter optimizer for sim2sim. The idea, in short, is to initialize parameters you're unsure about, and have the optimizer run a bunch of sims for ~randomly selected params, try find a pattern through the noise, test it, and iterate (until you have params that do well according to your reward function). Should be portable over to other envs with minor modifs.
  • Restructured sim2sim-related files
  • Added drawing of vectors for direction of command & empirical direction of robot to compare for play.py
  • Improved(?) arg-parsing
  • Added command manager
  • Added policies of some successful experiments
  • Added auto-config-saving at start of training runs
  • Added arms to stompymicro robot

Notes: will need to clean up tomorrow, revert some changes, uncommit some crimes

@henri123lemoine
Copy link
Contributor Author

henri123lemoine commented Dec 9, 2024

Still a few confusions:

  • Does this look right? @budzianowski I chose values from previous sim2sim attempts that I was told work, but I have not independently confirmed that.
  <default>
    <joint limited="true" damping="0.53" armature="0.008793405204572328" frictionloss="0.001"/>
    <geom condim="4" contype="1" conaffinity="15" friction="0.9 0.2 0.2" solref="0.001 2"/>
    <motor ctrllimited="true"/>
    <equality solref="0.001 2"/>
    <default class="visualgeom">
      <geom material="visualgeom" condim="1" contype="0" conaffinity="0"/>
    </default>
  </default>

Also curious what are all the parameters worth modifying, if the list is reasonably short?

  • For a bit I was investigating kds and kps parameters, what impact they have on the standing behavior, etc. But I just found out that the mujoco policy that uses ONNXModel("kinfer_test.onnx") generated from examples/experiments/standing/robustv1/policy_1.pt does not produce the same behavior as the policy produced through make_alg_runner. I still don't know why. I'm guessing I'm not giving in the inputs properly?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants